555 research outputs found
Green Algorithms: Quantifying the Carbon Footprint of Computation.
Climate change is profoundly affecting nearly all aspects of life on earth, including human societies, economies, and health. Various human activities are responsible for significant greenhouse gas (GHG) emissions, including data centers and other sources of large-scale computation. Although many important scientific milestones are achieved thanks to the development of high-performance computing, the resultant environmental impact is underappreciated. In this work, a methodological framework to estimate the carbon footprint of any computational task in a standardized and reliable way is presented and metrics to contextualize GHG emissions are defined. A freely available online tool, Green Algorithms (www.green-algorithms.org) is developed, which enables a user to estimate and report the carbon footprint of their computation. The tool easily integrates with computational processes as it requires minimal information and does not interfere with existing code, while also accounting for a broad range of hardware configurations. Finally, the GHG emissions of algorithms used for particle physics simulations, weather forecasts, and natural language processing are quantified. Taken together, this study develops a simple generalizable framework and freely available tool to quantify the carbon footprint of nearly any computation. Combined with recommendations to minimize unnecessary CO2 emissions, the authors hope to raise awareness and facilitate greener computation
Known allosteric proteins have central roles in genetic disease
Allostery is a form of protein regulation, where ligands that bind sites
located apart from the active site can modify the activity of the protein. The
molecular mechanisms of allostery have been extensively studied, because
allosteric sites are less conserved than active sites, and drugs targeting them
are more specific than drugs binding the active sites. Here we quantify the
importance of allostery in genetic disease. We show that 1) known allosteric
proteins are central in disease networks, and contribute to genetic disease and
comorbidities much more than non-allosteric proteins, in many major disease
types like hematopoietic diseases, cardiovascular diseases, cancers, diabetes,
or diseases of the central nervous system. 2) variants from cancer genome-wide
association studies are enriched near allosteric proteins, indicating their
importance to polygenic traits; and 3) the importance of allosteric proteins in
disease is due, at least partly, to their central positions in protein-protein
interaction networks, and probably not due to their dynamical properties
Recommended from our members
Towards clinical utility of polygenic risk scores.
Prediction of disease risk is an essential part of preventative medicine, often guiding clinical management. Risk prediction typically includes risk factors such as age, sex, family history of disease and lifestyle (e.g. smoking status); however, in recent years, there has been increasing interest to include genomic information into risk models. Polygenic risk scores (PRS) aggregate the effects of many genetic variants across the human genome into a single score and have recently been shown to have predictive value for multiple common diseases. In this review, we summarize the potential use cases for seven common diseases (breast cancer, prostate cancer, coronary artery disease, obesity, type 1 diabetes, type 2 diabetes and Alzheimer's disease) where PRS has or could have clinical utility. PRS analysis for these diseases frequently revolved around (i) risk prediction performance of a PRS alone and in combination with other non-genetic risk factors, (ii) estimation of lifetime risk trajectories, (iii) the independent information of PRS and family history of disease or monogenic mutations and (iv) estimation of the value of adding a PRS to specific clinical risk prediction scenarios. We summarize open questions regarding PRS usability, ancestry bias and transferability, emphasizing the need for the next wave of studies to focus on the implementation and health-economic value of PRS testing. In conclusion, it is becoming clear that PRS have value in disease risk prediction and there are multiple areas where this may have clinical utility
Ten simple rules to make your computing more environmentally sustainable.
Funder: Victorian Government’s Operational Infrastructure Support (OIS) programFunder: Health Data Research UKFunder: La Trobe University Postgraduate Research ScholarshipFunder: Munz Chair of Cardiovascular Prediction and Preventio
Visualizing Chromosome Mosaicism and Detecting Ethnic Outliers by the Method of “Rare” Heterozygotes and Homozygotes (RHH)
We describe a novel approach for evaluating SNP genotypes of a genome-wide association scan to identify “ethnic outlier” subjects whose ethnicity is different or admixed compared to most other subjects in the genotyped sample set. Each ethnic outlier is detected by counting a genomic excess of “rare” heterozygotes and/or homozygotes whose frequencies are low (<1%) within genotypes of the sample set being evaluated. This method also enables simple and striking visualization of non-Caucasian chromosomal DNA segments interspersed within the chromosomes of ethnically admixed individuals. We show that this visualization of the mosaic structure of admixed human chromosomes gives results similar to another visualization method (SABER) but with much less computational time and burden. We also show that other methods for detecting ethnic outliers are enhanced by evaluating only genomic regions of visualized admixture rather than diluting outlier ancestry by evaluating the entire genome considered in aggregate. We have validated our method in the Wellcome Trust Case Control Consortium (WTCCC) study of 17,000 subjects as well as in HapMap subjects and simulated outliers of known ethnicity and admixture. The method's ability to precisely delineate chromosomal segments of non-Caucasian ethnicity has enabled us to demonstrate previously unreported non-Caucasian admixture in two HapMap Caucasian parents and in a number of WTCCC subjects. Its sensitive detection of ethnic outliers and simple visual discrimination of discrete chromosomal segments of different ethnicity implies that this method of rare heterozygotes and homozygotes (RHH) is likely to have diverse and important applications in humans and other species
Power, false discovery rate and Winner's Curse in eQTL studies.
Investigation of the genetic architecture of gene expression traits has aided interpretation of disease and trait-associated genetic variants; however, key aspects of expression quantitative trait loci (eQTL) study design and analysis remain understudied. We used extensive, empirically driven simulations to explore eQTL study design and the performance of various analysis strategies. Across multiple testing correction methods, false discoveries of genes with eQTLs (eGenes) were substantially inflated when false discovery rate (FDR) control was applied to all tests and only appropriately controlled using hierarchical procedures. All multiple testing correction procedures had low power and inflated FDR for eGenes whose causal SNPs had small allele frequencies using small sample sizes (e.g. frequency 25%). Overestimation of eQTL effect sizes, so-called 'Winner's Curse', was common in low and moderate power settings. To address this, we developed a bootstrap method (BootstrapQTL) that led to more accurate effect size estimation. These insights provide a foundation for future eQTL studies, especially those with sampling constraints and subtly different conditions
FastSpar: rapid and scalable correlation estimation for compositional data.
SUMMARY: A common goal of microbiome studies is the elucidation of community composition and member interactions using counts of taxonomic units extracted from sequence data. Inference of interaction networks from sparse and compositional data requires specialized statistical approaches. A popular solution is SparCC, however its performance limits the calculation of interaction networks for very high-dimensional datasets. Here we introduce FastSpar, an efficient and parallelizable implementation of the SparCC algorithm which rapidly infers correlation networks and calculates P-values using an unbiased estimator. We further demonstrate that FastSpar reduces network inference wall time by 2-3 orders of magnitude compared to SparCC. AVAILABILITY AND IMPLEMENTATION: FastSpar source code, precompiled binaries and platform packages are freely available on GitHub: github.com/scwatts/FastSpar. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online
Translating Research Into Practice: Speeding the Adoption of Innovative Health Care Programs
Looks at case studies of four innovative clinical programs to determine key factors influencing the diffusion and adoption of innovations in health care
Recommended from our members
Systems biology and big data in asthma and allergy: recent discoveries and emerging challenges
Asthma is a common condition caused by immune and respiratory dysfunction, and it is often linked to allergy. A systems perspective may prove helpful in unravelling the complexity of asthma and allergy. Our aim is to give an overview of systems biology approaches used in allergy and asthma research. Specifically, we describe recent “omic”-level findings, and examine how these findings have been systematically integrated to generate further insight.
Current research suggests that allergy is driven by genetic and epigenetic factors, in concert with environmental factors such as microbiome and diet, leading to early-life disturbance in immunological development and disruption of balance within key immuno-inflammatory pathways. Variation in inherited susceptibility and exposures causes heterogeneity in manifestations of asthma and other allergic diseases. Machine learning approaches are being used to explore this heterogeneity, and to probe the pathophysiological patterns or “endotypes” that correlate with subphenotypes of asthma and allergy. Mathematical models are being built based on genomic, transcriptomic, and proteomic data to predict or discriminate disease phenotypes, and to describe the biomolecular networks behind asthma.
The use of systems biology in allergy and asthma research is rapidly growing, and has so far yielded fruitful results. However, the scale and multidisciplinary nature of this research means that it is accompanied by new challenges. Ultimately, it is hoped that systems medicine, with its integration of omics data into clinical practice, can pave the way to more precise, personalised and effective management of asthma.This work was supported by the National Health and Medical Research Council (NHMRC) of Australia via a postgraduate scholarship (ref. no. 1114753) to HHF Tang, research grant (1049539) to M Inouye and K Holt, and Fellowships (1061409) to K Holt and (1061435) to M Inouye. K Holt was further supported by a Senior Medical Research Fellowship from the Viertel Foundation of Australia
- …